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@ A method of genetically evaluating animals by using mitochondrial DNA is disclosed. Polymorphisms in 
mitochondrial DNA are detected by isolating, fragmenting, and sequencing the DNA. The restriction patterns and 
nucleotide sequences of mitochondrial DNA of different animals are correlated to expressed traits in the animals. 
This may be confirmed by comparing results to expression of the trait In maternal lineages of animals. Further, 
effects of maternal lineages are determined by partitioning maternal genetic variation from nuclear variation. 
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BACKGROUND OF THE INVENTION 

In genetically evaluating animals, researchers use theory based on nuclear genes. Higher organisms 
have a nucleus within their cells that contains nuclear genetic material. This has been the source of genetic 
5 material for scientists in analyzing genetic background and inheritance of animals. However, there are other 
organelles within the cells that have their own genetic makeup, one of which includes the mitochondria with 
its own DNA. 

Until now. it was widely held that essentially no variation existed in mitochondrial DNA. This invention 
relates to the finding that there is variation, or polymorphism, within mitochondrial DNA, and these 

w polymorphisms may be used to evaluate inheritable traits of animals. 

The use of mitochondrial DNA is a significant advancement in the science of animal breeding. Current 
genetic improvement programs are based solely on use of nuclear genes. Classical animal breeding 
methods have been developed first by using mass selection and then by progressing to evaluation of 
lineages, pedigrees, selection index, and best linear unbiased prediction methods. Progeny testing is the 

75 breeding method used most often in cattle. All methods are based on use of nuclear genetic material. 
Mitochondrial genes are inherited maternally. That is. they are transmitted from a female to both sexes of 
progeny; males do not transmit mitochondrial genes to their progeny. Thus, in a progeny test, bull mothers 
pass their maternal genetic material to their sons, but these sons do not transmit this genetic material to 
their daughters. Both nuclear and mitochondrial genetic material contributes to the production records of 

20 bull mothers, but. because only nuclear material is transmitted through their sons to his daughters, the bull 
mothers may be over- or under-evaluated, depending upon whether the bull mother's mitochondrial 
contribution is negative or positive' If the magnitude of the cytoplasmic or mitochondrial effect is known, the 
pedigree estimate of the bull mother's contribution can be adjusted appropriately. 

Milk volume and percentage of milk fat and milk protein (two constituents of milk, along with lactose 

25 and minerals) are economically the most important traits of dairy cattle. Reproduction and health of cattle, 
however, merit attention as well, because the interval between successive calvings and health costs also 
determine profitability of dairy cows. Effects of cytoplasmic inheritance on reproductive measures have 
been shown for number of days open (days from calving to next conception), days from calving to first 
detected estrus, first service conception rate, and number of services. 

30 It has now been found that there are polymorphisms in mitochondrial DNA and that a method of 
isolating and preparing specific fragments of the mitochondrial DNA has been developed so that fragments 
in the mitochondrial DNA may be associated with phenotypic expression as a particular trait In the animal. 
This discovery has been applied specifically to a herd of dairy cattle by associating the presence of 
particular nucleotide substitutions in the D-loop region of mitochondrial DNA (mtDNA) with increased 

35 volume productions of milk and with greater milkfat content. Other associations are also made for important 
economic traits such as days open and certain health traits such as mastitis and reproductive problems. 

Mitochondrial lineage influences on health differences of cattle have not been examined. Much work 
associating human diseases to mitochondrial DNA sequence differences has been reported. Kearns-Sayres 
Syndrome (KSS) and Leber's hereditary optic neuropathy (LHON) are examples of such diseases. In fact, 

40 LHON has been shown to be correlated with a single guanine-adenine transition that converts an arginine to 
a histidine in NADH dehydrogenase subunit 4 gene of mtDNA. Such mtDNA nucleotide sequence 
substitutions may affect health and other production traits in cattle. 

Molecular variation In bovine mtDNA has been demonstrated through RFLP analysis and comparison of 
nucleotide sequences. Displacement loop (D-loop) sequences of mtDNA from 36 distinct registered 

45 maternal lineages available for this study were previously compared Lindberg. G.L. 1989. Sequence 
heterogenity of bovine mitochondrial DNA. Ph.D Dissertation. Iowa State University, Lib. Ames. University 
Microfilm order No. DA-9014925. Ann Arbor, Ml. Fifty-one sequence differences were located, including 48 
single base pair (bp) substitutions, one 9-bp deletion, and two variable length poly G-C runs. Where 
possible. D-loops from two or more animals of the same maternal lineage were sequenced to verify 

so accuracy of mtDNA isolation and nucleotide sequencing and to confirm constancy of mtDNA within 
maternal lineages, supra. 

The D-loop region of mtDNA does not code for any known gene products; hence, sequence polymor- 
phisms there would not alter specific protein chain subunits. Promoters for transcription of both heavy and 
light strands of mtDNA as well as the origin of heavy strand replication, however, tie within the D-loop. Thus, 
55 sequence differences In the mtDNA D-loop may alter transcription or replication rates. Moreover, such D- 
loop polymorphisms may serve as markers of differences elsewhere on the mtDNA genome in coding 
regions of genes that are associated with phenotypic expression of traits. 
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The primary objects of the invention are, therefore, to evaluate animals genetically by using their 
mitochondrial DNA by phenotypic analyses of differences between maternal lineages. 

Another object of the invention is to use polymorphism in mitochondrial DNA of animals to associate the 
absence or presence of single or several markers with traits of economic importance including health trait 
5 importance in animals. 

A further object of the invention is to determine the sequence of nucleotides in mitochondrial DNA 
fragments and associate presence or absence of specific nucleotide sequences of fragments with traits of 
economic and health importance. 

A still further object is to associate polymorphism in mitochondrial DNA with its expression in the 
70 maternal lineage of an animal. 

A further object is to provide for improved genetic evaluation of animals by phenotypically determining 
the relative worth of maternal lineages for economic traits. 

Another object of the invention is to predictably Isolate efficient milk producers and lineages of milk 
producers from dairy herds by evaluation of polymorphism in mitochondrial DNA. 
75 Other objects of the invention will become apparent in the following disclosure. 

SUMMARY OF THE INVENTION 

The invention relates to the use of cytoplasmic genetic material, specifically the mitochondrial DNA, to 
20 genetically evaluate an animal. The mitochondrial DNA of an animal is isolated, fragmented and sequenced, 
and polymorphism within the mitochondrial DNA Is detected. This polymorphism is then correlated with the 
presence or absence of a desired trait. This relationship may be further confirmed by comparing the lineage 
of animals showing polymorphisms with lineages not showing the polymorphisms. Additionally, values of 
maternal lineages can be determined on a phenotypic basis to improve accuracy of bull mother selection 
25 and choice of females as donors for embryo transplants. 

DETAILED DESCRIPTION OF THE INVENTION 

Mitochondria are the source of cytoplasmic genetic Information. Mitochondria carry multiple copies of a 
30 circular genome that Is replicated and expressed within the organelle and is inherited maternally. It is the 
only genetic element known to be inherited cytoplasmically in mammals. Mitochondrial DNA In animals 
codes for 13 polypeptides that have been identified as components of the ATP synthesis system and codes 
for all of the transfer RNAs used in mitochondrial protein synthesis. The only non-coding portion of the 
genome is a region of approximately 900 base pairs that is referred to as the displacement loop or "D- 
35 loop". This D-loop is involved in the control of transcription and replication of mitochondrial DNA. Variation 
in mitochondrial DNA will affect the size of the mitochondrial DNA population in the cell, abundance of 
mitochondrial DNA gene transcripts and translation products, and mitochondrial oxidative energy transduc- 
tion capacity. 

As discussed more fully below, this invention involves the discovery that polymorphisms within the 
40 mitochondrial DNA of mammals can be associated with expressed traits in mammals. 

Mitochondrial DNA is first obtained from the blood of the animal. A variety of methods exist for isolating 
mitochondrial DNA, such as Lindberg, G.L, Koehler, CM., Mayfield. J.M., Myers, A.M., and Beitz, D.C. 
1992 Blochem. Genet. 30:27, and incorporated herein by reference. One procedure is described in more 
detail In the example below. 

45 After isolating the mitochondrial DNA, it then can be cloned, and typically this is done in Escherichia 
coli cells. This cloning must be done because the quantity of DNA required is usually more than can be 
practically collected from blood samples, and also because individual dairy cattle are not maintained 
permanently in the herd. This cloning allows for a permanent source of mitochondrial DNA from each 
animal. Cloning involves segmenting the mitochondrial DNA by using restriction endonucteases and then 

50 ligating enzymatically this segment to a vector by commonly used methods such as those described by 
Hackett, et aL, P.B., Fuchs, J.A., and Messing, J.W. ( An Introduction to Recombinant DNA Techniques , 
(Benjamin/Cummlngs Publishing Co., 2nd Ed. 1988). The recombinant constructs then are introduced into 
the Escherichia coli cells for propagation and amplification of the recombinant plasmid. This will yield an 
amount of the targeted mitochondrial DNA sufficient for restriction fragment length polymorphism analysis. 

55 Another method used in one laboratory for amplification of mitochondrial DNA is by the polymerase 
chain reaction (PGR). We have Isolated, amplified by . the PGR. and then determined the nucleotide 
sequence of the PGR-amplified mitochondrial DNA. Even though this procedure allows one to obtain 
nucleotide sequence data, only the previously described cloning procedure allows one to permanently store 
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the mitochondrial DNA from test animafs. 

Foltowing cloning, or using directly isolated mitochondrial DNA. the mitochondrial DNA then is 
fragmented. One method by which this may occur is by digesting the mitochondrial DNA with selected 
restriction enzymes. A number of different kind of enzymes are employed to obtain an adequate screening. 

5 These enzymes will fragment the DNA at particular selected sites, thereby resulting in fragments of known 
components. The restriction endonucleases are chosen to cause multiple cleavages within the target DNA 
and to create an optimal population of differently sized cleavage fragments capable of being resolved by 
gel electrophoresis. The number of restriction endonucleases employed will vary depending upon the size 
of the population to be screened and, in the example, below reflects the use of at least 10 restriction 

10 endonucleases. Restriction digests are conducted according to the specification of the supplier. The use of 
restriction endonucleases to measure mitochondrial DNA sequence relatedness is known and has been 
discussed, for examfDie, by Aviso, J.C., Lansman, R.A., and Shade, R.O.. "The Use of Restriction 
Endonucleases to Measure Mitochondrial DNA Sequence Relatedness to Natural Populations". Genetics 92: 
279 (1979). Examples of restriction endonucleases that are useful in characterizing bovine mitochondria are 

75 listed below. These are only examples; other endonucleases can be used. In general, these will produce 
between 5 and 25 fragments that are larger than 100 base pairs. See, e.g.. Anderson, S., deBruyjn, M.H., 
Coulson, A.R., Eperon. i.C, Sanger, F. and Young, I.G., J. Mol. Biol.. 156^683 (1982). Although by using 20 
different enzymes, a screen could be extended to cover about 220 recognition sites, far less extensive 
screens such as those discussed below in the example may be employed with satisfactory results. 

20 

TABLE 1 



Restriction Endonucleases 


Accl 


(lor 


Fokl 


(21) 


Sau96 


(12) 


StuI 


(9) 


Ahalll 


(11) 


Haelli 


(29) 


Seal 


(9) 


Styl 


(10) 


Banii 


(12) 


Hindi 


(10) 


ScrF1 


(14) 


TaqI 


(21) 


Bbvl 


(10) 


Hpail 


(16) 


SfaNI 


(21) 


Tthlll 


(24) 


BsmI ' 


(10) 


Hphi 


(19) 


Spel 


(8) 


Xbal 


(6) 


EcoRII 


(9) 


Sau3A 


(19) 


Sspl 


(14) 


Shol 


(5) 


Fnu4H1 


(15) 















The number in parenthesis is the number of expected fragments from the 
complete mtDNA that will be larger thann 100 base pairs. 



35 

Fragments resulting from digestion with restriction endonucleases are then subjected to agarose gel 
electrophoresis. This process is widely known to those skilled in molecular biology. Briefly, the fragments 
are first end-labeled with radioactive nucleotides and then are placed in a gel of a substance which provides 
■ a homogenous matrix, for example, agarose or polyacrylamide. The gel is permeated with a buffered 

40 aqueous solution, and the fragments move in an electrical field towards the positive pole, depending on 
their net charge. This analysis will reveal nucleotide sequence heterogenity at random sites throughout the 
bovine mitochondrial DNA. This is useful in analysis of the mitochondrial DNA. 

Any observed polymorphism can be verified by sequencing the nucleotides involved in the polymor- 
phism and observing sequence heterogeneity. Sequencing nucleotides of DNA can be performed by the 

45 dideoxy method described by Sanger, et al. 1977. [Spanger, F.. S. Nicklen, and A.R. Coulson. "DNA 
Sequencing With Chain-Terminating Inhibitors", Proc. Natl. Sci. USA, 74:5463 (1977)]. The presence or 
absence of a polymorphism then is compared with the phenotypic expression of a trait. Obviously, there are 
many methods for associating a particular trait with the sequenced DNA. The concept is to compare the 
maternal lines that contain or do not contain the particular genetic marker. If, for example, of 38 cow lines 

50 analyzed. 28 have the marker and the rest do not. one would then attempt to associate a particular trait with 
the presence or absence of that marker. This is then confirmed by looking at the phenotypic expression of 
the various lines. 

An animal model, which is a mixed linear model. Is used to estimate the effects due to maternal 
lineages. Effects are classified as either fixed (repeatable) or random (not repeatable) and are simulta- 

55 neously estimated. The fixed effects have properties of Best Linear Unbiased Estimates, and the random 
effects are Best Linear Unbiased Predictions. In this case maternal lineages have been classified as fixed 
when lineages were determined by tracing cows to their origin in the Holstein, Friesian Herdbook of 
America, and also classified molecular polymorphism or sequences as fixed. 
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DESCRIPTION OF EXPERIMENTAL ANIMALS 

Cows in this study were from a selection experiment founded at Iowa State University iri 1968. Heifers 
for this herd were purchased form 38 Holstein breeders throughout Iowa to keep the herd as genetically 

5 broad-based as possible. Cows were bred artificially to sires from commercial artificial ir^semination 
organizations, allowing a continuous Influx of nuclear genes. Frequencies of bovine lymphocyte antigen 
phenotypes were similar to frequencies in the U.S. Holstein population, meaning that these nuclear genes 
are likely representative of the entire U.S. Holstein population. Females were assigned to groups and 
artificially mated to bulls with either high or average estimated additive genetic transmitting ability for milk 

70 yield. Females born in each group were mated to new bulls chosen for that group, thus forming divergent 
selection lines that differed by 1308 kg of milk per cow per lactation when these data were analyzed. 

Ancestral pedigrees of registered foundation females in the herd were tracked backward through the 
Holstein- Friesian Herd Book. Eighty-one distinct maternal lineages were defined by convergence of 
maternal pedigrees after 1885. It is possible that these lineages would have been found to converge to 

76 fewer lineages had registration records been kept prior to importation of these cows from Europe in about 
1885. 

Thirty-six maternal lineages had surviving members in the herd when samples for nucleotide sequen- 
cing were collected. These lineages had from one to six purchased foundation females in the herd. 
Nucleotide sequence polymorphism data were obtained and all cows within the same maternal lineage that 
20 were ever in the herd were assumed to have identical mtDNA. Because of the among-lineage emphasis of 
the comparisons in the clonal nature of propagation of mitochondrial ONA. each lineage was represented by 
at least two cows. 

Isolation of Mitochondrial DNA 

25 " ' 

Total leukocytes were isolated from 400 ml of anticoagulated blood, obtained by jugular venipuncture, 
by low-speed centrtfugation after erythrocyte lysis with 140 mM ammonium chloride, pH 7.4. Leukocytes 
were lysed with 1% wtA/ol Triton X-100 in the presence of 1% wtA^ol sodium dodecyl sulfate. Nuclei and 
cell membranes were separated by the cytosolic fraction by centrifugation at 12,000 x g for 5 min. Soluble 
30 proteins were extracted from the resulting supernatant with 1:1 phenolxhioroform, and nucleic acids were 
precipitated from the aqueous phase with 3 vol. ethanol. This procedure yielded a mixture of supercoiled 
and relaxed covalently closed and nicked circular mitochondrial DNA molecules, RNA and only trace 
amounts of nuclear DNA contaminants (Koehler et al., 1988 as cited earlier). 

35 Selection and Cloning 

The region of the mitochondrial DNA molecule that includes the D-loop, which is markedly variable 
across mammalian species and in which mutations had been detected previously in Holstein cattle, was 
chosen as the target of the screen. The segment of the mitochondrial DNA molecule from the PstI 

40 restriction endonuclease cleavage site at nucleotide (nt) 15.738 to the Sstr cleavage site at nt 3,684 was 
excised and ligated enzymatically into the multiple cloning site of the 3.2 kilobase phagemid cloning vector 
pUC118. This segment constituted one fourth of the mitochondrial DNA molecule. The recombinant insert 
DNA was 4.284 base pairs (bp) long and included a 52-bp tRNA'*'*' gene, the 910-bp D-loop region, the 66- 
bp tRNA^~ gene, the 954-bp 12S rRNA gene, the 66-bp tPNA^*' gene, the 1.570-bp 16S rRNA gene, the 

45 74-bp tRNA'"'*" gene and a 592-bp portion of the first mitochondrially coded subunit of the nicotinamide 
adenine dinucleotide dehydrogenase complex (ND 1 gene). 

Recombinant constructs were introduced into Escherichia coli TG-1 cells (K12; hsdD5, supE, thi, 
[lacpro]/f':traD36, proA + B + , lad'', lacZ M15) by the calcium-heat shock method (Mandel, MJ. and Higa, A., 
"Calcium Dependent Bacteriophage DNA Infection," J. Mol. Biol.. 53:154 (1970)). Transformation was 

50 confirmed by insertional inactivation of vector alpha complementation of isoprdpyl-B-D- 
thiogaiactropyranoside-induced B-gatactosidase activity in ampicillin-resistant colonies. Amplified recom- 
binant plasmids were Isolated from 50-ml Luria broth cultures of transformed bacteria by alkaline lysis 
(Birnboim. H.C. and Doly, J.. "A Rapid Alkaline Extraction Procedure for Screening Recombinant Plasmid 
DNA", Nucleic Acids Res. . 7:1513 (1979)) to yield an amount of the target mitochondrial DNA sufficient for 

55 restriction fragment length polymorphism (RFLP) analyses. 
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EXAMPLE OF ESTIMATION OF MATERNAL LINEAGE EFFECTS 



Cows in our reseach herd were traced by maternal lineages to foundation femals in the Holstein- 
Friesian Herdbook to establish maternal lineages. Solution for maternal lineages were obtained from the 
5 following animal model. 

Yijwmno = U + YSi + P] + MU + LI + G(L)^:, + PE„ + A„ + eiiklmno. 

Effects in the model are as previously defined except U is the "high" or "average" sire selection line; G(L)- 
70 m:i is the mth sire birthyear group nested in the 1th selection line; PEn is the permanent environmental effect 
of the nth animal with a record; and An is composed of breeding values of sire and dam and a segregation 
effect and is tthe additive genetic effect of the nth animal . 

Because animal models are usually large, conventional tests of significance, requiring elements of 
variance-covariance matrices and, hence, direct inversion, are often not feasible. A test of significance using 
75 . mixed model conjugate normal equations was developed to test significance of maternal lineage effects. 
Variance ratios for random effects in the model were calculated from results of our variance component 
estimation. Maternal lineages were evaluated as fixed effects in this animal model including random animal 
and permanent environment effects. Ranges of maternal lineage estimates were 2934 kg milk, 1 54 kg fat. 
and .907 percent fat (Table 1) Maternal lineages, significantly affected kg fat and fat percentage. (Table 3) 
20 Maternal lineages also affected calculated net energy of milk, but were not apparently important for solids- 
not-fat yield or concentration, (not shown in Tables) 

Table 2 



Ranges of estimates or solutions of maternal lineage effects from animal model analysis 


Trait 


Maternal Lineage Range 




MEMILK2 


2934 


1514 


MEFAT3 


154 


59 


FAT %* 


.907 


.393 



^ With herd phenotypic standard deviation. 

2 Mature equivalent, 305-day milk yield. 

3 Mature equivalent, 305-day milk fat yield. 
35 * Milk fat percentage. 



TABLE 3 

40 : 





Trait^ 


F 


P> F 


Fixed sire groups 


MEMILK 


1.17 


-188 




MEFAT 


1.47 


.017 




FAT % 


1.71 


.001 


Westell groups 


MEMILK 


1.09 


.308 




MEFAT 


1.39 


.035 




FAT % 


1.82 


.001 



' For abbreviations, see Table 2. 



EXAMPLES OF RELATING MOLECULAR EVIDENCE TO TRAITS OF ECONOMIC IMPORTANCE 

55 Table 4 has location, type, and frequency of the 17 most common sequence polymorphisms of the 
mtDNA D-ioop in this herd of Holstein dairy cattle. Only those polymorphisms occurring in at least 4 
percent of cows in the herd are listed, because information on markers occurring in a very small number of 
cows would not be statistically informative. Transitions at bp 169 and 216 occurred in 80 and 84 percent of 
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cattle, respectively. The probable explanation is that the cow originally sequenced, Anderson, S,, M.H.L. 
deBruijn, A.R. Coulson, I.C. Eperow, F. Sanger, and KG. Young 1982, Complete sequence of bovine 
mitochondrial DNA. J. Mol. Biol. 156:683. had the rarer genotype at those two sites. From 30 to 608 cows 
were polymorphic at individual bp sites pf the least and most frequent mtDNA D-toop sequence difference. 
5 respectively. 

Production records of all cows in the 36 maternal lineages with known mtDNA 0-loop sequences were 
considered. Milk and fat yield records adjusted to a uniform age and lactation length, or mature equivalent 
(ME), basis were obtained. Percentages in milk of fat and solids-not-fat (SNF) which is total solids in milk 
less fat in milk, were known for each record of each cow. Up to seven production records were used for 

10 individual cows. 

Because mitochondria play an extensive role in energy metabolism, mtDNA polymorphism may alter 
energy content in milk. Fat. protein, and lactose are the carriers of energy in milk; however, information was 
complete since 1968 only for fat and solids-not-fat, which combines protein, lactose, and minerals. Net 
energy concentration in milk, which is based on fat and SNF, was calculated in terms of kilocalories per 
75 kilogram of milk and multiplied by lactation milk yield to approximate lactation energy production in terms of 
megacalories. 

To evaluate the effect of mtDNA D-loop sequence polymorphism, each cow was assigned a value of 1 if 
polymorphic or 0 if not polymorphic with respect to the first published mtDNA sequence at each of the 17 
locations considered. Each production trait was analyzed individually with the following animal model: 

20 

Yijknp-a + YS| + Pj + Xk + i9i + ... + /3i7 + PEn + an + Op. 

where Yijk„p is the milk, fat. fat percentage. SNF, SNF peircentage. energy concentration, or lactation energy 
record; u is an overall mean; YSj is the effect common to all cows calving in year-season; Xk is the effect 

25 common to cows in either the high or average selection line; ^i to are the binomial regressions of 
production record on mtDNA D-loop sequence polymorphisms; PEn is permanent environmental effect 
common to all records of cow n; an is effect of animal n and is composed of the additive genetic 
contribution of sire and dam breeding values and a Mendelian sampling effect; and ep is a random residual. 
Regression of production traits on mtDNA D-loop polymorphism was of primary interest. Effects of the 

30 overall mean, year-season of calving, parity, and selection line were treated as fixed effects in the mixed 
model to account for explainable environmental background. Additive genetic covariances among related 
individuals were Incorporated in this model. Permanent environment and additive genetic effects were 
treated as random and have properties of Best Linear Unbiased Prediction (BLUR). Random effects were 
assumed to be normally and independently distributed with mean expectations of zero. Variance among 

35 permanent environments was assumed to be 

var(PE) « 1 o^/' 2 , 
40 where 1 is an identity matrix and is error variance. Variance among animals was assumed to be 

var(a) = ho\/ 

45 where A is the numerator relationship matrix, Henderson, C.R. 1984. Applications of linear models in animal 
breeding. University of Guelph. Guelph, Canada. A included sires and dams of all cows back to foundation 
cows and included information for sires and paternal grandsires of bulls with daughters in the herd. 
Regression coefficients for sequence polymorphism are Best Linear Unbiased Estimates and were obtained 
by iterative methods described elsewhere, M.M. Schutz, A.C. Freeman, D.C. Beitz, and J.E. Mayfieid, J. 

50 Dairy Sci. (In Press) (1992). incorporated herein by reference. 

A total of 1800 records of 728 cows were used in this study. Along with effects of 17 mtDNA locations, 
there were 33 year-season, 7 parity, 2 selection line, 728 permanent environment, and 950 animal effects. 
Animal, or additive genetic, effects were for 728 cows with records, 197 relatives without records, and 25 
phantom parent groups to account for genetic similarities among animals without relationship or production 

55 information. 

Table 5 has overall means of production traits along with their regressions on nucleotide sequence 
differences. The polymorphism at base pair (bp) 363 has previously been associated with milk and fat yield 
and fat percentage. This site has since been shown to be highly heterogeneous within maternal lineages, 
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and is therefore unstable for use as a marker. Regressions of production on the polymorphism at bp 363 
will not be reported because the sequence difference can not be assumed uniform within maternal lineages. 

Eleven nucleotide sequence polymorphisms significantly influenced at least one production trail and 
traits were influenced in both positive and negative directions, A single Adenine to Guanine transition at bp 

5 169 related to increased production of 482 kg of milk. 24 kg of fat, 51 kg of SNF, and 456 megacalorles of 
energy per cow per lactation. On a purely evolutionary basis, site 169 previously has been found to 
demarcate two distinct mitochondrial families of cattle in the Holstein population, Lindberg, G.L. 1989. 
Sequence heterogenlty of bovine mitochondrial DNA. Ph.D. Dissertation. Iowa State Univ. Libr. Ames 
University Microfilm No. DA-9014925 Ann Arbor, Ml. 

10^ Polymorphism at site 18074 had a large positive effect on milk, fat. and SNF yield and lactation energy, 
while polymorphism at site 16231 had a negative effect on the same traits. Fat percentage in milk and 
energy concentration of milk were significantly affected in cows with polymorphism at sites 16058. 16085. 
16230, and 16247. Effects of polymorphism on SNF percentage were small. Possibly, larger effects would 
have been observed for lactose and protein if data for those components were individually available. Effects 

75 as large as these found are certainly economically important. 

An overall test of significance was used to determine whether effects of D-loop polymorphisms were 
specific for Individual locations. That the effect of a D-loop sequence polymorphism being present differs 
among bp sites was tested versus the null hypothesis that the effect of the presence of a polymorphism at 
one site equals the effect at any other site. Effects of presence of D-loop polymorphism at different sites 

20 were significant for milk fat yield {P> F <.06). 

Reproductive and health traits also have economic importance in dairy cattle Improvement programs. 
Table 6 has regressions of reproduction and health costs on sequence polymorphisms In the mtONA D- 
loop. Number of days open and number of breedings (artificial Inseminations), along with reproductive 
costs, were the measures of reproduction considered. Reproductive costs included costs of insemination 

25 and semen, as well as costs of reproductive exams and treatments for post-calving disorders such as 
metritis and retained placenta. 

In addition to reproductive costs, health differences were measured by mammary costs and total health 
costs. Using costs of health disorders was necessary because incidences of specific events or diseases 
occur too infrequently to be of use in analysis of data from a single herd. Health costs were grouped by 

30 body systems and total health cost Is the sum, of reproductive, digestive, mammary, respiratory, and skin 
and skeletal costs. Mammary costs reflected costs of treatment and medications required for cows with 
mastitis or injured teats, but discarded milk value was not Included. 

Polymorphism at bp 1 69 was associated with days open, and reproductive costs and days open were 
increased in animals polymorphic at bp 16058. There was a very large favorable impact on the reproductive 

35 complex In cows with mtDNA D-loop sequence difference at bp 16085. This single T to C transition was 
related to a decrease of 36 days open, one Insemination, and $12.82 in reproductive costs. Thirty-six days 
open is nearly 2 reproductive cycles. 

Mammary costs were altered in cows with mtDNA D-!oop sequence polymorphisms at four locations. 
None were locations significantly affecting milk or milk fat yield or fat percentage. Total health costs were 

40 increased when polymorphisms at bp 106 and 16074 occurred, and decreased when polymorphisms at bp 
216 occurred. Significant effects were not observed for costs of digestive, respiratory, and skin skeletal cost 
categories. Incidences of health disorders in these systems were sparse. 

Current dairy cattle breeding programs are centered around selection of bulls used for artificial 
insemination. Two-stage selection is practiced for bulls entering artificial insemination organizations. The 

45 first stage is pedigree selection, which is based on sire and dam information. Bulls are finally chosen for 
extensive use based on a progeny test. The largest selection differentials are for the sire to bull pathway 
where mtDNA polymorphism Is not Important If the mitochondrial genome is transmitted only from female 
parents. The dam to bull pathway is equally important, but the accuracy of selection is less than at the sire 
to bull pathway. A bull's estimated transmitting ability based on pedigree may be biased if the contribution 
50 from his dam is not adjusted for mitochondrial influence on her records. While the bull would acquire 
mtDNA from the dam, it would not be transmitted to his offspring. Adjustment of bull dam's records for 
mtDNA influences would allow more accurate prediction of expected genetic contribution of a bull to his 
daughters. 

Data from our experimental herd can be used to demonstrate the magnitude of adjustments that could 
55 be expected for bull mothers. Over 4^ years we had 41 cows from 19 different lineages that were In the top 
2% of cows In the United States, which qualifies them as potential bull mothers. The top 2% was imputed 
by USDA-AIPL. The range of maternal lineage effects for these cows was positive 76.4 kg to a megative 
30.9 kg fat. So. without adjustment for maternal lineage effects these cows as bull mothers would have had 
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76.4 kg more and 30.9 kg less fat included in their phenotypic estimates. Thus, bull mothers would have 
been over and under estimated, respectively. 

The dam to cow pathway has traditionally been selected least intensely. However, new developments in 
reproductive technology and embryo nnanipulatlon seem poised to make this pathway of selection more 

5 viable. Differences in mtDNA could be incorporated into embryo transfer breeding programs to better 
choose donor and recipient females to produce replacement heifers. Current cloning techniques require 
nuclear transplantation into an enucleated ovum without regard to cytoplasmic content. Potential exists for 
using mtDNA sequence polymorphism to identify ova of females with inferior nuclear genetics in superior 
mtDNA background as candidates for enucleation and a subsequent introduction of nuclei with greater 

10 genetic potential. 

Table 4 



25 



location, type, and frequency of seventeen most common sequence polymorphisms of mtDNA D-toops in 






a herd of dairy cattle 






Location in 


polymorphic 


Frequency 


Location in 


Polymorphic event 


Frequency 


D-Loop* 


event 




D-Loop* 






8 


G-A 


.07 


16058 


C-T 


.12 


106 


T-C 


.14 


16074 


T-C 


.07 


169 


A-G 


.80 


16085 


T-C 


.05 


216 


Var. length 


.84 


16111 


A-C* 


.04 




G-C run 




16113 


T-C 


.11 


363 


C-G- 


.46 


16141 


T-C 


.11 


16022 


C-A 


.14 


16230 


C-T 


.06 


16049 


C-T 


.08 • 


16231 


C-T 


.12 


16057 


G-A 


.12 


16247 


C-T 


.13 



a Location is defined by the first published mtDNA sequence (Anderson et al., 1982). Polymorphic 
30 event and frequency is also with regard to that sequence. 
' Base pair substitution is a transversion. 
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' TABLE 5 
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Regression of production on sequence polymorphisms in the mtDNA D-Loop and overall production 






means in a herd of dairy cattle 








Location in 


Milk (kg) 


Fat (kg) 


SNF (kg) 


Lactation 


Fat (%) 


.Snf (%) 


Energy 


D-Loop 








Energy (Meal) 






Concentration 
(Kcal/kg) 


Q 
O 




1 


19 


85 


-.fc 1 






106 


464 


16 


47 


343 


.uo 


nfl 
.uo 


0 


1RQ 


to^ 


24- 


51' 


456* 


.uo 


.uo 


D 


216 


-157 


1 


-7 


- 10 


.07 


.09^ 


10 


16022 


-113 


6 


-26 


-162 


.04 


.01 


3 


16049 


9i39* 


29 


102+ 


756 


-.07 


.17 


1 


16057 


.577+ 


-21 


-48 


-399 


.01 


.11 


7 


16058 


39 


20 


9 


225 


.30" 


.06 


31 + 


16074 


842' 


37- 


85" 


749" 


.14 


.05 


16 


16085 


-197 


-20 


-36 


-343 


-.21 + 


-.19+ 


-29+ 


16111 


107 


-5 


15 


52 


-.05 


.05 


-3 


16113 


32 


3 


-10 


3 


.09 


-.21 + 


-3 


16141 


-336 


-8 


-35 


-198 


.10 


-.05 


6 


16230 


383 


33 


39 


490 


.39- 


.06 


39- 


16231 


-650+ 


-28 


-61 + 


-522+ 


-.12 


-.01 


-12 


16247 


351 


21 


32 


346 


.17* 


-.01 


16+ 


Overall Mean^ 


8085 (1771) 


288 (60) 


745 (165) 


5888 (1225) 


3.63 (.44) 


9.21 (.40) 


732 (50) 



^ Overall standard deviations are in parentheses. 



*P> t^ .01. 
'P> t'^ .05 
30 +P>"ti.10. 
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TABLE 6 



Regression of reproduction and health costs on sequence polymorphisms in the mtDNA OLoop and 
overall means of reproduction and health cost means in a herd of dairy cattle 


Location in 
D-Loop 


Days open 
(d) 


Number of 

Breedings (n) 


Reproduction 
Costs ($) 


Mammry Costs 
($) 


Total Health Costs 
($) 


8 


-31.7 


-.73 


-6.12 


24.79* 


10.22 


106 


0.5 


.13 


-2.66 


16.24* 


21.28* 


169 


14.6+ 


.34 


6.05* 


.84 


11.47 


216 


-0.5 


.12 


.07 


-9.70* 


-14.54* 


16022 


-13.4 


-.12 


-3.89 


9.77 


1.83 


16049 


16.1 


.28 


.88 


4.58 


17.94 


16057 


9.7 


.14 


-2.90 


-9.31 


-15.93 


16058 


28.3* 


.68 


2.44 


2.94 


18.44 


16074 


3.7 


.34 


6.28 


13.56 


25.93* 


16085 


-36.3* 


-.99* 


-12.82* 


-2.31 


19.70 


16111 


-14.1 


-.64 


-4.06 


-8.94 


-11.19 


16113 


-5.5 


-.63 


-1.64 


1.18 


-2.24 


16141 


-11.7 


-.36 


-3.01 


-15.67+ 


-19.58 


16230 


-4.4 


-.02 


-2.04 


13.10 


12.12 


.16231 


-16.0 


-.53 


-3.53 


-4.81 


-11.30 


16247 


-8.2 


.02 


2.47 


3.19 


7.71 


Overall mean^ 


135 (72) 


2.61 (2.24) 


38.73 (34.49) 


19.07 (43.8) 


77.67 (60.14) 
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a Overall standard deviations are in paranthases. 
' P > t ^ .05. 
* P>'tS .10. 
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Mitochondrial DNA (mtDNA) displacement-loop (D-Loop sequence polymorphism information from 36 
maternal lineages was evaluated. Of 17 base pair substitutions evaluated, several were significantly 
associated with milk, fat, and solids-not-fat-production. Another marked a large impact on fat percentage 
and net energy concentration. Positive and negative effects on alt production traits were observed. One 
base pair substitution was related to a large favorable decrease in days open, number of breedings, and 
reproductive costs. 

Maternal lineage groups defined by several methods of classification using mtDNA sequence char- 
acteristics were evaluated with animal models. Groups defined as those maternal lineages with or without 
base pair substitution at nucleotide 169 accounted for increased milk fat and estimated milk energy 
production. Clustering the 36 maternal lineages using 17 mtDNA D-loop sequence differences produced 
groups with significant effect on fat percentage and energy concentration. 

This method allows one to employ analysis of the mitochondrial DNA to predict which animals differ in 
traits of economic importance. Thus, for example a single Adenine to Guanine transition at bp 169 relates to 
increased milk production as measured by volume and lowered milk fat; polymorphism at 16074 has a large 
positive effect on milk, fat and SNF; polymorphism at bp 16058 is evidence of increased days open and 
reproductive costs. This provides a new source of genetic material to use in evaluating inheritance in an 
animal. Mitochondrial influences on a dam's production are not transmitted through her male offspring to his 
progeny, and such cytoplasmic effects currently are included in an estimate of the dam's transmitting 
ability, which may cause the dam's transmitting ability to be over- or under-estimated for the purpose of 
predicting her son's progeny test. Knowledge of the dam's maternal effect allows adjusting her transmitting 
ability to accurately predict her son's progeny test. This method provides a basis for exploitation of potential 
gains from selection from more efficient maternal genotypes based upon cytoplasmic differences. It is also 
possible to identify cows with good maternal traits as herd replacements. Even small differences in feed 
efficiency can be exploited and would have major economic importance when applied to the dairy cattle 
population. By way of another example, feed costs are 40 to 50% of dairy production expenses, and even if 
only a 5% increase in feed efficiency resulted from application of mitochondrial DNA to selective practices, 
a savings of over 3 billion dollars annually could be realized by the dairy industry in the United States. 

Other potential applications include tiie possibility of combining and transplanting superior mitochondrial 
DNA with superior nucleic DNA. resulting in animals which predictably will carry an inheritable desired trait, 



11 



EP 0 566 793 A1 



such as high milk production. 

Another application is choice of cows from the best maternal lineages (cytoplasmically or mitochon- 
drically based), as donors for embryo transplants. The practice of embryo transplantation is used exten- 
sively in the dairy industry. 
5 Thus, it is apparent that the invention accomplishes at least all of its objectives. 

Claims 

1. A method of determining the phenotypic effects of mitochondrial DNA polymorphism in dairy cattle 
10 comprising (a) detecting said polymorphism in mitochondrial DNA of said animals; and (b) associating 

nucleotide sequences of mitochondrial DNA to traits of economic importance. 

2. A method as claimed in claim l, comprising associating nucleotide sequences of mitochondrial DNA to 
milk production traits. 

75 

3. A method as claimed in claim 2, characterised in that polymorphisms at site 16074 and at sites 16058, 
16085, 16230 and 16247 are associated with milk production. 

4. A method as claimed in claim 1 , further comprising fragmenting and sequencing mitochondrial DNA of 
20 said animals. 

5. A method as claimed in claim 1, characterised in that polymorphism at bp 169 and 16058 are 
associated with days open and reproductive costs. 

2S 6. A method as claimed in claim 1 , characterised in that polymorphism within the mitochondrial DNA is 
the D-loop sequence. 

7. A method as claimed in claim 4. further comprising comparing said sequences of mitochondrial DNA of 
said animals to sequences of mitochondrial DNA of another animal to detect polymorphisms. 

30 

8. A method as claimed in claim 7, characterised in that phenotypic expression of a trait is correlated to 
said polymorphism in mitochondrial DNA. 

9. A method of genetic evaluation of dairy cattle comprising sequencing mitochondrial DNA fragments of 
35 said dairy cattle and associating the expression or absence of a milk production traits in said dairy 

cattle with the presence or absence of said mitochondrial DNA fragments. 

10. A method of evaluating inheritable milk production traits and health in dairy cattle by partitioning effects 
of mitochondrial lineages from nuclear effects, comprising isolating mitochondrial DNA from a group of 

40 said dairy cattle, fragmenting and sequencing said mitochondrial DNA. comparing said mitochondrial 
DNA of said group of dairy cattle to detect nucleotide sequence polymorphism and correlating said 
polymorphic mitochondrial DNA sequences with a milk production trait of said dairy cattle. 
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